
SEQ ID N0:1 





j. 


ccc a a r pr~ , rr ir P 


/-i /-■» AAA mmp a m 

GGAAA I I GAT 


AGAATTTTGT 


GTGAGGTGCG 


TAGCGGCTCT 








G 1 GGGGGGAG 


ATGTCTGGTC 


TCAGGTAGGG 


CGACAATGGA 




-L U X 


o a r , P r nr ,r P r PA c* 

KdPvoKd lol 1 AG 


T"TPPPPPPmP 

I -LGGGGGGTG 


TATCGCTCTC 


TGCGTGGCGC 


ATTG GGTC AT 




1 R1 


GG 1 GGGGGGA 


/"^ a m a m/~i a m a rn 

GATATGATAT 


TCCGCTAGAG 


GATTACTGAT 


AGTTTCTGCC 






lb i GGGGG I I 


GTCGGGCTTG 


TCGGGCTTGT 


CGGGCTTGTC 


GGGCCTGTCC 






LiLl 1 G 1 GGG 


GCCTGTCCTC 


ACTTTTTCAC 


aatcaaaaaa 


TGGGCGAAGC 






LL I 1 GTTGTT 


CTATAGTTCT 


TATAGTTCAT 


ACGAAAATTA 


CACATAATTA 




TCI 


rp/^' A "A m a f\ mm 

1 LAA i AGGTT 


7\ m m z*^ ^% m m tv. ta 

ATTCGCTTAA 


AAGGGAGTAA 


TTGGGCCGCA 


AAAGGGAGTA 




41 UX 


Al IGGGCCGC 


AAAAGGGAGT 


AATTGGGCCG 


CAAAAGGGAG 


TAATTGGGCC 




yl CI 

<± j X 


bA I ATGGGTT 


GTTTACATGG 


GGAGGAATCC 


CCTTAATCAT 


TTCTCCCCAT 




DUX 


ppp7\ a A Z" 1 A /""< A 

GGGAAAGACA 


ACACAAGTGG 


CCGCAGACCG 


GGCCTTCGAC 


CAGACAAAAA 




JjI 


Lib! bt I GGG 


m ft tv /^i /~i m /*-< 

TGCCGAGGTG 


GCGAGAGGGG 


TCTATATGCG 


CAATCCGCCC 


- ; * 

1 sf 

iU 

HI 


DUX 


GGGG 1 GGAGG 


CGCTCAAGCT 


CATGCATTTA 


ATGATAGCCA 


CTGCGGGCGG 


DjX 


GGGGA 1 GGGT 


GATGATGTGC 


GCCATGAAAT 


GCGGCTGGCC 


GACATTCGCG 




/ VJ 1 


LAA 1 L.VjtAL,IjG 


A m/" > < A A A A A /""I 

G A i GAAAAAC 


CATGACCGTG 


AGAGCCTGAC 


CCCGCTGTTC 


Fi! 

e ;p 
f ! ^ 


7 R 1 
/ jl 


vjjA^jVjAGG 1 AG 


/"* f** o m /~* r~* m 

GGGGTGCGGT 


GTTGACCCAT 


GATGACCCTG 


CAAAGATGAT 


cn 


O U 1 


GG 1 GAG AG I G 


GGCGGCTTGG 


TCGATGAGGC 


GC GAAT AG AC 


TACCGCCAGG 


OC1 
O jl 


AGGGAAGGGG 


G G AAG TC C T A 


GTGACGTGGA 


CCTTCCGGAG 


TACATTCCGT 




J U X 


GG1 A I GGGGG 


CGGAGTCGAA 


CCACTGGGCC 


ATTCTCGACC 


GTCAAACGGT 


c:i 


y Di 


AI 1GGATGTG 


GGTAGTAAGT 


ATTCCGTGCT 


GCTGTTCCAG 


CACGTCTCTA 




X U U X 


GIGTGGCCAA 


TCTTGATCGG 


ATGAGCGCGA 


AAACCTTTAC 


GGTCCCCGAG 


c:i 


X U j X 


1 1GGGGGCGC 


TCCTTGGAGT 


GCCCGAGGGA 


AAGATGGTTC 


GTTGGAACGA 




X x ux 


mm a a /~t a /"i "a 

G GTT AAC AG A 


TTTGCTCTCA 


AACCTGCACT 


GGATGAGATC 


AACCATTTAT 




1 1 CI 

XX DX 


GGGGTCTGAC 


ATTGACGGCA 


AAGCCGACCA 


AGATTGGCCG 


TAGCGTGGCA 




x z u x 


A prnp m/^ 1 a /"tin a 

AG I G 1 GAGTA 


T AGGC TGGGA 


AGTGAAAGAC 


GACCCAACCG 


TCGCCAGGCG 




X Zj X 


GGAGG TGGCG 


GGTTCCAAGG 


TCGGTCGAGA 


TGCTCGTCGC 


AGAGGGGCAG 




X j U X 


G GGAAAG GAT 


AGCCCCCTCC 


TTCCCAGAAG 


CGGGCGGGAT 


CACCTACAGT 




1 TCI 

1j J X 


r^c* a ppmmppp 

GGAGG I TGGC 


m /~i -t\ /^i m t\ t\ 

TGGAGCTGAA 


ACGCTCTGCT 


GGCAGCAACA 


AGGACAACGA 




1 4t U X 


1 G I GA I GGGG 


TCAGACTTCC 


GGCGTTTCTG 


TCGGGAGAGA 


GGCGTGCGTC 




X *£ j X 


1 GGAGGG I GG 


"t\ Tv tv #n T\ rr^ /"i tv -* 

AAACATCGAA 


AAACTGTTTT 


TAGATTTCTG 


CGCAAAGGTA 




i cm 

1 Jul 


GGGAAGGTTT 


t\ yTrj m m m m tv 

GAGTTTTGAG 


GTATTTC AC C 


GCAATAGTGT 


TAAATGACTT 




1 CC1 
1 J Jl 


i GG 1 GAAAGG 


ATGTGCAATA 


TAGC GGTAAG 


ACTATGAAAT 


ACACGGCTGG 




X D UX 


AGAGGCTGCA 


TV TV TV /^i TV TV y— 1 m 

AAAGCAACGG 


GTGTGGCGAC 


CGCAACCATC 


ACTCGGGCGC 




1651 


TAAAAAGCGG 


TAAAATTTCC 


GGTAAAAAAG 


ATGAATCTGG 


GGCATGGGTT 




1701 


ATAGATCCTG 


CAGAATTGCA 


CAGAGTGTTT 


CCTCCCATTT 


CAAAGAAATA 




1751 


CACCGAAACA 


CCTAACACGC 


AAGTATATGG 


TAAGCGTGAT 


GAAACACATG 




1801 


AAATGACCTC 


AGAAATCAGC 


GCATTAGAGC 


GTGAAGTTCG 


GACTTTACGC 




1851 


GATGCTTTAT 


CTGATGCCAG 


GGAGGATCGC 


GACAAATGGC 


GCGACATGGC 




1901 


CGAGCGTCTT 


TCAATTTCAT 


CACCGATGAG 


AGAGGAAGAC 


CGCCCCCCTC 




1951 


AAAAACAAAG 


ATGGTGGAAG 


ATATTCTGAT 


CCTGGGCTTC 


AGGAGCCTTG 



FIGURE 1A 



2001 CCTTTACTGG CGGAAAAACG CGATATTGAG GCACAGGCCC GCACTTTAGA 
2051 GGCGGAAGCC TATAACGAGT AC C AAAAC AC TAGAAGCCAG ATTGAGGAAA 
2101 ATAGGGAACG TG 
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SEQ ID NO: 2 



1 

1 


TGGtGAACGC 


ATTGGCTTGA 


TGTTTGAGAA 


AAGCGAAAAG 


ACCCGGCCAC 


51 


AGTTGTGGGT 


AGAGCGTCGA 


TATGTGCAAG 


ACCTGATGCT 


TGCTGACATC 


101 


GAACTCCGTG 


TCTACCTCGC 


ATCGTCGCTG 


TATCAGCCTG 


CTGCGGATGG 


151 


CGGAAAGCCC 


GCCTATGGTC 


GTCACGCAGC 


CCTTAAGGCG 


ATGCGCGACT 


201 


TGGCCCATGC 


CGATCTGGTG 


CGTTTCACCA 


TCGGCCGGAT 


TACGCAACTG 


251 


GAGATGATCC 


TAGAGCGGTT 


AACCGAGACA 


TCTGGTTAAC 


GCCATAAAGG 


301 


CTGCGGCATG 


AAAATAGGCG 


GACAATCTGC 


GCTTGGCCGC 


CCCCGTTCTC 


351 


AGCCGTGCTT 


GCTCTCTGCC 


TGCATGGCAC 


GACGCAGGAT 


CGCGTTCATA 


401 


CGGGTCTGAT 


ATCCAGACCC 


GCCCGCCTTG 


AGCCATGCCA 


GCACATCGGC 


451 


ATCAAGCCGC 


GCGGTGATCT 


GCTGCTTGAT 


CGGGCGATAG 


AAGCGCCCAC 


501 


GCTCGGCGTC 


TGCCCATTGG 


GCTTCGGTCA 


GCTCGGGAAC 


ATCGTTGGTG 


551 


TCGATCTGCT 


CGGGCGGCAG 


AGCGTCCAGC 


CGCGCCAATT 


TCTTGCGGCG 


601 


CTCCTCGGTA 


AGAGCGGGCA 


GCGTATC GAA 


GGTGTATTCA 


ACCATTGGCA 


651 


TATCTCTTCC 


TTTCCTGCGG 


TGTAGCGCGG 


CGAGCCGAAA 


TGATGCGGAT 


701 


CGTCTCGACC 


GGATCGGGGC 


CAGCCTCGAT 


GATCAGGTGG 


GCAACCAGAA 


751 


GGACGGCAGC 


GCCATAGATC 


TGCCCAACGG 


TTTGCCAGCG 


GTATTCCCCG 


801 


CCCTCGATCC 


TATCCTGAAC 


CGTCAGGTGC 


AACGGATCGG 


CGAACACATG 


851 


CACAGCATCC 


TCGAACCGGA 


TGCCATGCTT 


CTTTTCGTTC 


GTTTCCGCCT 


901 


TGGCGGGATC 


CCAGATAAAC 


CGCATCTTCA 


TGGCAGAATT 


ATAACTACAC 


951 


ATTTGTAGTT 


ATTCAATGGC 


AAGTCGCAGG 


TTCAAATCAC 


GCCCCCAAAC 


1001 


CGCAACTGTA 


TTCGTTCTAC 


TCACGCGCGC 


TTTTGAATAG 


AAGC TTGC AT 


1051 


GATAACACCC 


GCCGCGTCCT 


CAACAAAATA 


AGGCAAATCC 


GCCGCGCTGG 


1101 


CGCAATCTGC 


GCTTTGTCGA 


TGCAAGGTCT 


TGTGGTTTCA 


TACTGCAAGA 


1151 


GCATGCAAGG 


AATTGCCCCG 


GATGAGCACC 


AC GACGAC AC 


CCACCAAGCC 


1201 


GGCCTGGAAC 


AAGGGCCGCG 


TTGTCGGGAA 


AAAGCCGCCG 


CTGACACCTG 


1251 


ACCAGATTGC 


CCTGATCCGT 


CTCATCCTGC 


GCCAGGAACG 


GGCGTGGCGG 


1301 


GATCTGGCTC 


TGTTCAACGT 


GGCGATCGAC 


ACCAGTTTGC 


GCGGCTCGGA 


1351 


CCTCGTGCGC 


CTGCGCGTCT 


CGGATGTGGC 


GACCCCAGCT 


GGTCTGCGTG 


1401 


AGATCGTCGA 


GATCCGCCAG 


AAGAAGACCG 


AGGCCCGCAA 


TGTCCGCCCC 


1451 


GTACAGGCCC 


GCCTGTCGGA 


GGGGACACGC 


GAGAGCC TGC 


GGGTCTATCT 


1501 


CGCGGCCTCT 


GACAAGCCGC 


TGCACAGCTG 


GCTGTTCACC 


GGACAGGGCA 


1551 


TCCGCTGGTC 


CCACACCCAC 


CTTAGCGAGA 


GCCAGCTGTG 


GCGCCTGTTC 


1601 


AAGTCC TGGC 


TCGAGAAGGC 


GCGGCTCGAT 


CCCAGCCTCT 


ACGGGCTGCA 


1651 


CTCGCTGCGC 


CGAACCTTCC 


CCAGCCACAT 


CTACCGCGAG 


ACCGGCAATC 


1701 


TGCGCGCCGC 


ACAGCTGCTG 


CTGGGCCATG 


CCAGCATCGA 


GAGCACCAAG 


1751 


GAGTACATCG 


GCACCGAGCA 


AGCCGAGGCC 


CTC GATATCG 


CACGGAGGTA 


1801 


TCACCTCTAA 


CCCATGGAGA 


CCTATCTCGA 


GAAGCGCATC 


CCCGCCAAGA 


1851 


ACACAGCACG 


GTTCTACCGC 


ATGGCGGTCC 


TGCCGAACCT 


GTTCGGGGAA 


1901 


TGGACGCTGT 


ATCGAGAATG 


GGGCCGCATT 


GGCATCAGCG 


GCCGCATCCG 


1951 


GCTCGATTGG 


TTTGAGAGTG 


AACAAGATGC 


CATCGCTGCG 


ATGCTCGCCA 



FIGURE 2A 




2001 TCGAGACCGC CAAGCGTCAG CGCGGGTATT GGCTCGAGCC CATCCAGATT 
2 051 GACATGTTCC CAGGGGCATA ACAGGCCATC AATGTAAGAG TGCAAGCGGA 
2101 GCAAGCAAAA GCCATTTCAC AGTGAGGTGG CAGATGTTCC TGTTTCACAG 
2151 TGAAAGCGCT GATGCTGTTT CCACGCCACA GACTGATACG ACCAAAGCAA 
2201 CGGGGTCTGC CGCCACAGAC CGGTTCGCCG GCCACCCGCA GAAACGCAGG 
2251 TAAAATGGCG ATTTCCGCAA AAAAACCGTG CAAATGATGG CAAATCACCA 
23 01 TCCAGTTTCA TCCTGAAACC CGTCGCTCAA CATGAACGAG CAGGCCATCA 
'2351 TCCAAGCCCC AGAAACGCGG TGCGGCGACT ACAGATGAGC GATGTTCTGG 
2401 CTCATAGGCT GCAAGGCCCT GCAACAGTGA TTTCACCGTG AGATTGCAGG 
2451 GTCTTTTGGC TCTCCCGCAA GAGCCACCTC AGGGTGAGCG AGCTAGCCGT 
2501 CTAGGTTCAC AGTGAAATCG CTGAGGAGCG TTGCGGGGCT TATGGTTTGG 
2551 CTGGTCACGT TGGCCATCGG AATGGAGCAT ACGATGGCTT CTACGCAGTC 
2601 GAATCCTGAG GCTTCACGTG GGAAAAATAC GCTCCAAAAA AGCCCTGACC 

2 651 AAATCTTGGA AAAATTGCTT GAAAAGTTTG CTTCTAAAAA ACTGGGAACG 
2701 AGATATGCAC GAGATCCCTT ACGAGTGCTG TAGGAGTAAT GCAGTGGACA 
2751 AAAACGCCAT TTTTTGCCCC AGTAGGAGTA ATGGAGTGGT TATTTTTTGG 
2801 GAGATTTTGC TTCAGTAGGA GTAACGCGTT GjGTTAAATTT GCTTGATTGG 
2851 CGGTTCAAAT CGACCACCGA GCTGCCGTTG GTCGTATTCG ATCTGCCCCG 
2901 CAATTGGGCA CTTGCAGGCC ATCCCCCTGA ACTTCTGGCG ATGACCATTT 
2951 CGAAGGCAAT GGGTCGAAAT TCATAGAATT TTGTGTGAGG TGCGTAGCGG 

3 001 CTCTGACAGG GGTGCTGCGC GGAGATCTCT GGTCTCAGGT AGGGCGACAA 
3 051 TGGAGAGGTG TTAGTTGCCC CCTGTATCGC TCTCTGCGTG GCGCATTGGG 
3101 TCATCCTGCC CGGACATATG ATATTCCGCT AGAGGATTAC TGATAGTTTC 
3151 TGCCTGTCGG GCTTGTCGGG CTTGTCGGGC TTGTCGGGCT TGTCGGGCCT 

32 01 GTCCCTCTTG TCCCGCCTGT CCTCACTTTT TCACAATCAA AAAATGGGCG 
3251 AAGCCCTTCT TGTTCTATAG TTCTTATAGT TCATACGAAA ATTACACATA 

33 01 ATTATCAATA GCTTATTCGC TTAAAAGGGA GTAATTGGGC CGCAAAAGGG 
33 51 AGTAATTGGG CCGCAAAAGG GAGTAATTGG GCCGCAAAAG GGAGTAATTG 
3401 GGCCGATATC GGTTGTTTAC ATGGGGAGGA ATCCCCTTAA TCATTTCTCC 
3451 CCATGGGAAA GACAACACAA GTGGCCGCAG ACCGGGCCTT CGACCAGACA 
3501 AAAACTGTGC TCCCTGCCGA GGTGGCGAGA GGGGTCTATA TGCGCAATCC 
3 551 GCCCCGCCTG CAGGCGCTCA AGCTCATGCA TTTAATGATA GCCACTGCGG 
3 601 GCGGCCGCAT GGCTGATGAT GTGCGCCATG AAATGCGGCT GGCCGACATT 
3 651 CGCGCAATCG ACGGCATGAA AAACCATGAC CGTGAGAGCC TGACCCCGCT 
3701 GTTCGAGGAG CTAGCCGCTG CGGTGTTGAC CCATGATGAC CCTGCAAAGA 
3 751 TGATCGTGAC AGTCGGCGGC TTGGTC GATG AGGCGCGAAT AGACTACCGC 
3 801 CAGGAGGCAA GCGGCGAACT CCTAGTGACG TGGACCTTC C GGAGTACATT 
3 851 CCGTCGTATG GCGGCGGAGT CGAACCACTG GGCCATTCTC GACCGTCAAA 
3 901 CGGTATTCCA TCTCGGTAGT AAGTATTCCG TGCTGCTGTT CCAGCACGTC 
3 951 TCTAGTCTCG CCAATCTTGA TCGGATGAGC GCGAAAACCT TTACGGTCCC 
4001 CGAGTTGCGG GCGCTCCTTG GAGTGCCCGA GGGAAAGATG GTTCGTTGGA 
4051 ACGACGTTAA CAGATTTGCT CTCAAACCTG CACTGGATGA GATCAACCAT 

FIGURE 2B 



• • 

4101 TTATCGCGTC TGACATTGAC GGCAAAGCCG ACCAAGATTG GCCGTAGCGT 
4151 GGCAAGTGTG ACTATAGGCT GGGAAGTGAA AGACGACCCA ACCGTCGCCA 
4201 GGCGCGAGCT GGCGGGTTCC AAGGTCGGTC GAGATGCTCG TCGCAGAGGG 
4251 GCAGCGGAAA CGATAGCCCC CTCCTTCCCA GAAGCGGGCG GGATCACCTA 
43 01 CAGTCCACGT TGGCTGGAGC TGAAACGCTC TGCTGGCAGC AACAAGGACA 
43 51 ACGATCTGAT CGCCTCAGAC TTCCGGCGTT TCTGTCGGGA GAGAGGCGTG 
4401 CGTCTGGACG CTGCAAACAT CGAAAAACTG TTTTTAGATT TCTGCGCAAA 
4451 GGTAGGGAAG GTTTGAGTTT TGAGGTATTT CACCGCAATA GTGTTAAATG 
4501 ACTTTCGTGA AACGATGTGC AATATAGCGG TAAGACTATG AAATACACGG 
4551 CTGGACAGGC TGCAAAAGCA ACGGGTGTGG CGACCGCAAC CATCACTCGG 
4601 GCGCTAAAAA GCGGTAAAAT TTCCGGTAAA AAAGATGAAT CTGGGGCATG 
4651 GGTTATAGAT CCTGCAGAAT TGCACAGAGT GTTTCCTCCC ATTTCAAAGA 
4701 AATACACCGA AACACCTAAC ACGCAAGTAT ATGGTAAGCG TGATGAAACA 
Q 4751 CATGAAATGA CCTCAGAAAT CAGCGCATTA GAGCGTGAAG TTCGGACTTT 

^ 4801 ACGCGATGCT TTATCTGATG CCAGGGAGGA TCGCGACAAA TGGCGCGACA 

fij 4851 TGGCCGAGCG TCTTTCAATT TCATCACCGA TGAGAGAGGA AGACCGCCCC 

CfJ 4901 CCTCAAAAAC AAAGATGGTG GAAGATATTC TGATCCTGGG CTTCAGGAGC 

J If 4951 CTTGCCTTTA AAACCTGAAT CAGCATTCTA GCGATGCTGA TAAGAAGTAA 

£f| 5001 ATATAGCCAC AATAGAGCGG CCATTTTCCA TTCACATACA GCTCATCATG 

ff 5051 TGATCAATAT CAAGTATTGA TATTCATCAA TGGAGAAGAA TTTACATGTA 

O 5101 TCACAGGATC ATCACAGCAT TTGTTTTTGT ATTTCTAAGT GCTAACATAA 

fl\ 5151 CTATCGCTGG CCCTAAAGAA GATTGTACTA TTGCAGTATC TCACCTTGGG 

|f! 5201 TTTCAGACCG ATAATTACAG CTTTGTCGAA GCCGGTTTTT TTGCCAGAGA 

p 52 51 GAGACACGTT TTTGATGGTG TAATAAACTG CTACGTATCT CATGATGGTA 

^ 5301 ACATACACAG CATCATCCGG GGCAACACAC CTCTTATGGA AGATGGATAT 

53 51 TATGGCCCAG AAGTACTGGC GGAAAAACGC GATATTGAGG CACAGGCCCG 
5401 CACTTTAGAG GCGGAAGCCT ATAACGAGTA CCAAAACACT AGAAGCCAGA 
5451^ TTGAGGAAAA TAGGGAACGT GCCCTCGAGG CGCTGCGGCT AGCTAGCAGT 
5501 CCTTTTATTA ATAATGGTAG TACAGAAGAA CAGACAATTA TACAGGCCGC 
5551 AACTCCGACG GCAGATCCTG TTGTATCTGT ACCCGTGGCA TCTCCAGAAT 
5601 CTAAACAAAG TCGAGAGCCG GAACCGGCTG CTGTTCCAGC ATCAGTTTCT 
5651 GTTAGAGAGA TGTGGAGCAC GGCTGACAGA TTGACCACCC GTACATGCCC 
5701 ATCGACTCGA TGCGGAGCAA CTAGCTGGGT AACAGATGGA ACTAAAGTAA 
5751 CAGTTTATGA AGAAAAAGAC GGTTGGTCTA GAATCGGAGA GCTACAGTCT 
5801 GCAATGTGCA TAAATGGAAT AAGTGGCGCG GTCGATTCAG GTGAATCTTC 
5851 CTGCAATCCC ACCAATGGTA TCGTTAATGG GCAATTCGCA CCCTGGGTTT 
5901 TCTCGGATTA TCTTACGATC CAAGAGCCAG AAGCTCCCAT ATCCACCCAA 
5951 GAGTGTCGAA ATATGGGGCT CGAGAACTCA GATAATTACC GTATCTATTC 
6001 TAGTCAGTTC TGCACTGCCG CTCTCGAAAT GATCAACGAT AGAGTATGCA 
6051 ATACATCTGA TTTCAGAGAT TTAGCTTGGT TATCTTCTCC TGAAAGAGGA 
6101 CAGGATTACT ACTTCACCTA TTGTGGCGGA TTTCAACCTC AAAACAGATG 
6151 GTATTTGAAT GTCAGGACAG GTGAAATCAC CCGCTGATAT TCCACCAAGG 

FIGURE 2C 
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6201 

6251 

6301 

6351 

6401 

6451 

6501 

6551 

6601 

6651 

6701 

6751 

6801 

6851 

6901 

6951 

7001 

7051 

7101 

7151 

7201 

7251 

7301 

7351 

7401 

7451 

7501 

7551 

7601 

7651 

7701 

7751 

7801 

7851 

7901 

7951 

8001 

8051 

8101 

8151 

8201 

8251 



TGAGTCCTGT 
ATGAGATCAA 
TTAGGATGAA 
TTTACAAGAG 
CCCAACGGGA 
CCCGTCCCGG 
GGCAGCGATT 
AGATATGGGC 
TGAAAGCTGA 
TGACGTGAAC 
GGTCTGCCGG 
CCGATCAGCA 
GCTGCGATGC 
AGCCGACTAG 
CAGCGGATGA 
AGCCAAACAC 
TATATCTAGA 
GAAGGCAATG 
CCCGACCAGC 
ATTATCCTAC 
GAATTTCATC 
CCTGCGCCGG 
TCACTGAACG 
GCCAATGACG 
GATCACGTCA 
TTGTCGAGCA 
GATCTGGTCC 
TAATGTCAGG 
GGGATCGTAT 
CATGGCATCA 
GGTCTATTAT 
AGTTCGAGCG 
CTGCTGGTCA 
GCGCAATACG 
GAAAGCTGTT 
GAGCGTACCA 
CTTCGTTGCG 
CCTGGCATGT 
CTGCGCGGGC 
GGTTAACAGT 
AGAAAGGTGC 
TTCTACGAGC 



AGATCAGACT CTCAAGGAGT 
CATAGATAGG TGTTCAGTCC 
TCTGTCCGGC TCTTGACATA 
AAAGTCAGCG GCCTCGAAGC 
TGAGTAGAGC GAACAAGCAA 
AAGCTGCAGC CGCAGAATCC 
ATTTGGATAG GACACAATCC 
ATGCGCCGCG CGTGATCCTC 
CTATGCCCTA CGCATCGCGC 
CGCACCTACC TGCTGCCCGC 
AC AGGTC GTT GCGCTGTTCG 
GCTTCATGTT CTGGGCGGTG 
TACAAATTCA TCGAGAATTA 
TGCGGACGGG CGGCAGGTCG 
CCTCACTGTT GATCGGCTTG 
GCGCGCAACA GCAACGCGGC 
CCTGCTTCGG GACCCGGATC 
AGTTCGGAAT CACTTACGGT 
AGCCACAGGC ACCACTGGTT 
AGACGAGCAG CTGGAGGGGT 
ATGGTGTATC GGATTGGGAA 
TTGCACCGCG TCATCTGGAA 
CGACCAGTCG GTTGATCGGG 
GGGGCACGAA ACTGTCGAAG 
AAATGGTC C A GCGGATCGGC 
CATAAACAAA GGTCTCGGTG 
TGAAGGCCTG TCTGGTCGTC 
AACTTTACAA GCGAGGTCAT 
CAAGCAGGCA TTCGAGAACA 
CCGGGGATAA CCTCGGCTCT 
ATCTACAACA CGCCGGATTT 
GGTCAATGCC AGCTCCATGC 
GCGCCTTCGT TGGCCATTCG 
ATCCGCGATC ACCTGCGTGT 
CGATGCCATG GCGAAGGGGG 
TCGAAGAATT GCTGGAAATG 
CTGTCGCTGC TCTATCAGGG 
CGATCATATC ATTCCCCAAG 
GCAATCTGCC CGAGCATCGC 
TTGGGCAACC TGCAACTTTT 
GCTGCCATTC AGGTCATGGA 
AGCATATGAT CCCGGCGCAC 

FIGURE 2D 



AAACGTTTTA 
CGGCATCTGG 
CCCCCGCGTG 
CGCTCTAGCC 
AGCGCCTAGC 
GGAAAAAAGA 
TTTTCTATTA 
ATTCGATACA 
ACCATCGGTG 
AATCCAACGC 
ACTCTCTGTT 
GACGAGGAGA 
TCGGCCCGAA 
TCCTTGTGCT 
CGCGGCACAT 
GGCGTGGTCG 
CGAAGAACTC 
CTCTCTTTCC 
CAAGGTGGGA 
TGATTGCCAA 
AAGGGGCTGG 
AGACGAGGGC 
TGCTGGACAT 
GCAGACCTGC 
CCGCGAGGAA 
CGCCGAACAA 
TGCGATTATG 
CGGCAGGATC 
CGTTCCGCCT 
TTGAACGCGG 
CGATTTCCGA 
ACCTCTGGTT 
GATCAGACCA 
AGGCCGCGAT 
GACGGCTATC 
CAATATGGCA 
CATCGACTGG 
CGGACGCTCA 
ATTCAGGAAA 
GCGCGGAGAT 
TTACCGGACG 
CTTGAACTGT 



ATCCATCTCC 
TGGATCGGGT 
AAACCCTGTC 
GATGCCCGGG 
TATGGCTCTG 
AAAAATACAT 
ATATACAACA 
ATCCAAATCC 
CCGTCATTGA 
CCCTATGTCT 
GAAGGGCTAT 
CCAAGGCAGA 
ATGATGAACG 
CGACGGACAG 
TCTCTGAGAA 
GCAAAAACGC 
CGATGAAGAC 
ATGAACGCCG 
TCGATACTGG 
GGTGAAGACC 
CGGAAGACAC 
ATCAACTTTT 
CTTCGTGCGG 
TGATGTCGAT 
ATCGGCGGCT 
GATGAGTCGC 
ATGTCGTCTA 
GAAAGCAACT 
GCTGAACAGG 
TGCTGCCTCT 
GGATCGAGCG 
GGTGAACAGC 
TCACCACCGC 
TTCCCAGTAC 
TCAGGTGGAT 
AGCCCCGGAC 
AACGGATCGA 
GAAAAATGTG 
TCTTGGGCGC 
GAGAATATCG 
GCGCGTTGAT 
GCGATGTACT 



83 01 GCATCTGCCC GAGTTCGTGC GCGAACGGGA AAAGGTGATC CGGCGCCGTT 

8351 TGATGGAGTT GGTCGGAGCA CGACGCGCAT GAATGAGGTC GTCTTGTCAC 

8401 GCGAAGAGCT GCGTCAATCT TGTCTCGACC TGCTTGAAAA ACGCGCTGTC 

8451 GAACACCCTG CGGGACACCA AGGCAAGCTC GCCGCCCGCT ATGTTGTGCA 

8501 CCGCGACGA 



FIGURE 2E 
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SEQ ID NO: 3 



1 TCGCGCGTTT CGGTGATGAC GGTGAAAACC TC TG AC AC AT GCAGCTCCCG 
51 GAGACGGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG 
101 TCAGGGCGCG TCAGCGGGTG TTGGCGGGTG TCGGGGCTGG CTTAACTATG 
151 CGGCATCAGA GCAGATTGTA CTGAGAGTGC ACCATATGCG GTGTGAAATA 

2 01 CCGCACAGAT GCGTAAGGAG AAAATACCGC ATCAGGCGCC ATTCGCCATT 
251 CAGGCTGCGC AACTGTTGGG AAGGGCGATC GGTGCGGGCC TCTTC GCTAT 

3 01 TACGCCAGCT GGCGAAAGGG GGATGTGCTG CAAGGCGATT AAGTT GGGTA 

3 51 ACGCCAGGGT TTTCCCAGTC ACGACGTTGT AAAACGACGG CCAGTGAATT 

4 01 CGGCAATGGG TCGAAATTCA TAGAATTTTG TGTGAGGTGC GTAGCGGCTC 
451 TGACAGGGGT GCTGCGCGGA GATCTCTGGT C TC AGGTAGG GCGACAATGG 
501 AGAGGTGTTA GTTGCCCCCT GTATCGCTCT CTGCGTGGCG CATTGGGTCA 
551 TCCTGCCCGG ACATATGATA TTCCGCTAGA GG ATT AC TG A TAGTTTCTGC 
601 CTGTCGGGCT TGTCGGGCTT GTCGGGCTTG TCGGGCTTGT CGGGCCTGTC 
651 CCTCTTGTCC CGCCTGTCCT CACTTTTTCA CAATCAAAAA ATGGGCGAAG 
701 CCCTTCTTGT TCTATAGTTC TTATAGTTCA TACGAAAATT ACACATAATT 
751 ATCAATAGCT TATTCGCTTA AAAGGGAGTA ATTGGGCCGC AAAAGGGAGT 
801 AATTGGGCCG CAAAAGGGAG TAATTGGGCC GCAAAAGGGA GTAATTGGGC 
851 CGATATCGGT TGTTTACATG GGGAGGAATC CCCTTAATCA TTTCTCCCCA 
901 TGGGAAAGAC AACACAAGTG GCCGCAGACC GGGCCTTCGA CCAGACAAAA 
951 ACTGTGCTCC CTGCCGAGGT GGCGAGAGGG GTCTATATGC GCAATCCGCC 

1001 CCGCCTGCAG GCGCTCAAGC TCATGCATTT AATGATAGCC ACTGCGGGCG 
1051 GCCGCATGGC TGATGATGTG CGCCATGAAA TGCGGCTGGC CGACATTCGC 
1101 GCAATCGACG GCATGAAAAA CCATGACCGT GAGAGCCTGA CCCCGCTGTT 
1151 CGAGGAGCTA GCCGCTGCGG TGTTGACCCA TGATGACCCT GCAAAGATGA 
1201 TCGTGACAGT CGGCGGCTTG GTCGATGAGG CGC GAATAGA CTACCGCCAG 
1251 GAGGCAAGCG GCGAACTCCT AGTGACGTGG ACCTTCCGGA GTACATTCCG 
13 01 TCGTATGGCG GCGGAGTCGA ACCACTGGGC CATTCTCGAC CGTCAAACGG 
1351 TATTCCATCT CGGTAGTAAG TATTCCGTGC TGCTGTTCCA GCACGTCTCT 
1401 AGTCTCGCCA ATCTTGATCG GATGAGCGCG AAAACCTTTA CGGTCCCCGA 
1451 GTTGCGGGCG CTCCTTGGAG TGCCCGAGGG AAAGATGGTT CGTTGGAACG 
1501 ACGTTAACAG ATTTGCTCTC AAACCTGCAC TGGATGAGAT CAACCATTTA 
1551 TCGCGTCTGA C ATTG AC GGC AAAGCCGACC AAGATTGGCC GTAGCGTGGC 
1601 AAGTGTGACT ATAGGCTGGG AAGTGAAAGA CGACCCAACC GTCGCCAGGC 
1651 GCGAGCTGGC GGGTTCCAAG GTCGGTCGAG ATGCTCGTCG CAGAGGGGCA 
1701 GCGGAAACGA TAGCCCCCTC CTTCCCAGAA GCGGGCGGGA TCACCTACAG 
1751 TCCACGTTGG CTGGAGCTGA AACGCTCTGC TGGCAGCAAC AAGGACAACG 
1801 ATCTGATCGC CTCAGACTTC CGGCGTTTCT GTCGGGAGAG AGGCGTGCGT 
1851 CTGGACGCTG CAAACATCGA AAAACTGTTT TTAGATTTCT GCGCAAAGGT 
1901 AGGGAAGGTT TGAGTTTTGA GGTATTTCAC CGCAATAGTG TTAAATGACT 
1951 TTCGTGAAAC GATGTGCAAT ATAGCGGTAA GACTATGAAA TACACGGCTG 
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2001 GACAGGCTGC AAAAGCAACG GGTGTGGCGA CCGCAACCAT CACTCGGGCG 
2 051 CTAAAAAGCG GTAAAATTTC CGGTAAAAAA GATGAATCTG GGGCATGGGT 
2101 TATAGATCCT GCAGAATTGC ACAGAGTGTT TCCTCCCATT TCAAAGAAAT 
2151 ACACCGAAAC ACCTAACACG CAAGTATATG GTAAGCGTGA TGAAACACAT 

22 01 GAAATGACCT CAGAAATCAG CGCATTAGAG CGTGAAGTTC GGACTTTACG 
2251 CGATGCTTTA TCTGATGCCA GGGAGGATCG CGACAAATGG CGCGACATGG 

23 01 CCGAGCGTCT TTCAATTTCA TCACCGATGA GAGAGGAAGA CCGCCCCCCT 
23 51 CAAAAACAAA GATGGTGGAA GATATTCTGA TCCTGGGCTT CAGGAGCCTT 
2401 GCCTTTACTG GCGGAAAAAC GC GAT ATTGA GGCACAGGCC CGCACTTTAG 
2451 AGGCGGAAGC CTATAACGAG TACCAAAACA CTAGAAGCCA GATTGAGGAA 
2501 AATAGGGAAC GTGGGATCCT CTAGAGTCGA CCTGCAGGCA TGCAAGCTTG 
2551 GCGTAATCAT GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC 
2601 AATTCCACAC AACATACGAG CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG 
2651 CCTAATGAGT GAGCTAACTC ACATTAATTG CGTTGCGCTC ACTGCCCGCT 
2701 TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA TCGGCCAACG 
2751 CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CTCTTCCGCG CTCGGTCTTG 
2801 CCTTGCTCGT CGGTGATGTA CTTCACCAGC TCCGCGAAGT CGCTCTTCTT 
2851 GATGGAGCGC ATGGGGACGT GCTTGGCAAT CACGCGCACC CCCCGGCCGT 
2901 TTTAGCGGCT AAAAAAGTCA TGGCTCTGCC CTCGGGCGGA CCACGCCCAT 

2 951 CATGACCTTG CCAAGCTCGT CCTGCTTCTC TTCGATCTTC GCCAGCAGGG 

3 001 CGAGGATCGT GGCATCACCG AACCGCGCCG TGCGCGGGTC GTCGGTGAGC 
3051 CAGAGTTTCA GCAGGCCGCC CAGGCGGCCC AGGTCGCCAT TGATGCGGGC 
3101 CAGCTCGCGG ACGTGCTCAT AGTCCACGAC GCCCGTGATT TTGTAGCCCT 
3151 GGCCGACGGC CAGCAGGTAG GCCGACAGGC TCATGCCGGC CGCCGCCGCC 
3201 TTTTCCTCAA TCGCTCTTCG TTCGTCTGGA AGGCAGTACA CCTTGATAGG 
3251 TGGGCTGCCC TTCCTGGTTG GCTTGGTTTC ATCAGCCATC CGCTTGCCCT 
33 01 CATCTGTTAC GCCGGCGGTA GCCGGCCAGC CTCGCAGAGC AGGATTCCCG 
3351 TTGAGCACCG CCAGGTGCGA ATAAGGGACA GTGAAGAAGG AACACCCGCT 
3401 CGCGGGTGGG CCTACTTCAC CTATCCTGCC CGGCTGACGC C GTTGG AT AC 
3451 ACCAAGGAAA GTCTACACGA ACCCTTTGGC AAAATCCTGT ATATCGTGCG 
3501 AAAAAGGATG GATATACCGA AAAAATCGCT ATAATGACCC CGAAGCAGGG 
3 551 TTATGCAGCG GAAAAGCTTC CTCGCTCACT GACTCGCTGC GCTCGGTCGT 
3 601 TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA ATACGGTTAT 
3 651 CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC AAAAGGCCAG 
3701 CAAAAGGCCA GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG 
3751 GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT 
3801 GGCGAAACCC GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC 
3 851 TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC 
3 901 CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCATAGC TCACGCTGTA 
3951 GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC 
4001 GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT 
4051 TGAGTCCAAC CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG 
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4101 GTAACAGGAT TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG 
4151 AAGTGGTGGC CTAACTACGG C TAC ACTAGA AGGACAGTAT TTGGTATCTG 
4201 CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 
4251 CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG 
43 01 CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC 
4351 TACGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG 
4401 TCATGAGATT ATCAAAAAGG ATCTTCACCT AGATCCTTTT GGGGGGGGGG 
4451 GGCGCTGAGG TCTGCCTCGT GAAGAAGGTG TTGCTGACTC ATACCAGGCC 
4501 TGAATCGCCC CATCATCCAG CCAGAAAGTG AGGGAGCCAC GGTTGATGAG 
4551 AGCTTTGTTG TAGGTGGACC AGTTGGTGAT TTTGAACTTT TGCTTTGCCA 
4601 CGGAACGGTC TGCGTTGTCG GGAAGATGCG TGATCTGATC CTTCAACTCA 
4651 GCAAAAGTTC GATTTATTCA ACAAAGCCGC CGTCCCGTCA AGTCAGCGTA 
4701 ATGCTCTGCC AGTGTTACAA CCAATTAACC AATTCTGATT AGAAAAACTC 
4751 ATCGAGCATC AAATGAAACT GCAATTTATT CATATCAGGA TTATCAATAC 
4801 CATATTTTTG AAAAAGCCGT TTCTGTAATG AAGGAGAAAA CTCACCGAGG 
4851 C AGTTC CAT A GGATGGCAAG ATCCTGGTAT CGGTCTGCGA TTCCGACTCG 
4901 TCCAACATCA ATACAACCTA TTAATTTCCC CTCGTCAAAA ATAAGGTTAT 
4951 CAAGTGAGAA ATCACCATGA GTGACGACTG AATCCGGTGA GAATGGCAAA 
5001 AGCTTATGCA TTTCTTTCCA GACTTGTTCA ACAGGCCAGC C ATT AC GC TC 
5051 GTCATCAAAA TCACTCGCAT CAACCAAACC GTTATTCATT CGTGATTGCG 
5101 CCTGAGCGAG ACGAAATACG CGATCGCTGT TAAAAGGACA ATTACAAACA 
5151 GGAATCGAAT GCAACCGGCG CAGGAACACT GCCAGCGCAT CAACAATATT 

52 01 TTCACCTGAA TCAGGATATT CTTCTAATAC CTGGAATGCT GTTTTCCCGG 
5251 GGATCGCAGT GGTGAGTAAC CATGCATCAT CAGGAGTACG GATAAAATGC 

53 01 TTGATGGTCG GAAGAGGCAT AAATTCCGTC AGCCAGTTTA GTCTGACCAT 
5351 CTCATCTGTA ACATCATTGG CAACGCTACC TTTGCCATGT TTCAGAAACA 
5401 ACTCTGGCGC ATCGGGCTTC CCATACAATC GATAGATTGT CGCACCTGAT 
5451 TGCCCGACAT TATCGCGAGC CCATTTATAC CCATATAAAT CAGCATCCAT 
5501 GTTGGAATTT AATCGCGGCC TCGAGCAAGA CGTTTCCCGT TGAATATGGC 
5551 TCATAACACC CCTTGTATTA CTGTTTATGT AAGCAGACAG TTTTATTGTT 
5601 CATGATGATA TATTTTTATC TTGTGCAATG TAACATCAGA GATTTTGAGA 
5651 CACAACGTGG CTTTCCCCCC CCCCCCATTA TTGAAGCATT TATCAGGGTT 
5701 ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA 
5751 ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCTG ACGTCTAAGA 
5801 AACCATTATT ATCATGACAT TAACC TATAA AAATAGGCGT ATCACGAGGC 
5851 CCTTTCGTC 
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SEQ ID NO: 4 

1 GGCAATGGGT CGAAATTCAT AGAATTTTGT GTGAGGTGCG TAGCGGCTCT 
51 GACAGGGGTG CTGCGCGGAG ATCTCTGGTC TCAGGTAGGG CGACAATGGA 
101 GAGGTGTTAG TTGCCCCCTG TATCGCTCTC TGCGTGGCGC ATTGGGTCAT 
151 CCTGCCCGGA CATATGATAT TCCGCTAGAG GATTACTGAT AGTTTCTGCC 

2 01 TGTCGGGCTT GTCGGGCTTG TCGGGCTTGT CGGGCTTGTC GGGCCTGTCC 
251 CTCTTGTCCC GCCTGTCCTC ACTTTTTCAC AATCAAAAAA TGGGCGAAGC 

3 01 CCTTCTTGTT CTATAGTTCT TATAGTTCAT ACGAAAATTA CACATAATTA 
351 TCAATAGCTT ATTCGCTTAA AAGGGAGTAA TTGGGCCGCA AAAGGGAGTA 
401 ATTGGGCCGC AAAAGGGAGT AATTGGGCCG CAAAAGGGAG TAATTGGGCC 
451 GATATCGGTT GTTTACATGG GGAGGAATCC CCTTAATCAT TTCTCCCCAT 
501 GGGAAAGACA ACACAAGTGG CCGCAGACCG GGCCTTCGAC CAGACAAAAA 
551 CTGTGCTCCC TGCCGAGGTG GCGAGAGGGG TCTATATGCG CAATCCGCCC 
601 CGCCTGCAGG CGCTCAAGCT CATGCATTTA ATGATAGCCA CTGCGGGCGG 
651 CCGCATGGCT GATGATGTGC GCCATGAAAT GCGGCTGGCC GACATTCGCG 
701 CAATCGACGG CATGAAAAAC CATGACCGTG AGAGCCTGAC CCCGCTGTTC 
751 GAGGAGCTAG CCGCTGCGGT GTTGACCCAT GATGACCCTG CAAAGATGAT 
801 CGTGACAGTC GGCGGCTTGG TCGATGAGGC GCGAATAGAC TACCGCCAGG 
851 AGGCAAGCGG CGAACTCCTA GTGACGTGGA CCTTCCGGAG TACATTCCGT 
901 CGTATGGCGG CGGAGTCGAA CCACTGGGCC ATTCTCGACC GTCAAACGGT 
951 ATTCCATCTC GGTAGTAAGT ATTCCGTGCT GCTGTTCCAG CACGTCTCTA 

1001 GTCTCGCCAA TCTTGATCGG ATGAGCGCGA AAACCTTTAC GGTCCCCGAG 
1051 TTGCGGGCGC TCCTTGGAGT GCCCGAGGGA AAGATGGTTC GTTGGAACGA 
1101 CGTTAACAGA TTTGCTCTCA AACCTGCACT GGATGAGATC AACCATTTAT 
1151 CGCGTCTGAC ATTGACGGCA AAGCCGACCA AGATTGGCCG TAGCGTGGCA 
12 01 AGTGTGACTA T AGGC TGGGA AGTGAAAGAC GACCCAACCG TCGCCAGGCG 
1251 CGAGCTGGCG GGTTCCAAGG TCGGTCGAGA TGCTCGTCGC AGAGGGGCAG 
1301 CGGAAAC GAT AGCCCCCTCC TTCCCAGAAG CGGGCGGGAT CACCTACAGT 
1351 CCACGTTGGC TGGAGCTGAA ACGCTCTGCT GGCAGCAACA AGGACAACGA 
1401 TCTGATCGCC TCAGACTTCC GGCGTTTCTG TCGGGAGAGA GGCGTGCGTC 
1451 TGGACGCTGC AAACATCGAA AAACTGTTTT T AG ATTTC TG CGCAAAGGTA 
1501 GGGAAGGTTT GAGTTTTGAG GTATTTCACC GCAATAGTGT TAAATGACTT 
1551 TCGTGAAACG ATGTGCAATA TAGCGGTAAG ACTATGAAAT ACACGGCTGG 
1601 ACAGGCTGCA AAAGCAACGG GTGTGGCGAC CGCAACCATC ACTCGGGCGC 
1651 TAAAAAGCGG TAAAATTTCC GGTAAAAAAG ATGAATCTGG GGCATGGGTT 
1701 ATAGATCCTG CAGAATTGCA CAGAGTGTTT CCTCCCATTT CAAAGAAATA 
1751 CACCGAAACA CCTAACACGC AAGTATATGG TAAGCGTGAT GAAACACATG 
1801 AAATGACCTC AGAAATCAGC GCATTAGAGC GTGAAGTTCG GACTTTACGC 
1851 GATGCTTTAT CTGATGCCAG GGAGGATCGC GACAAATGGC GCGACATGGC 
1901 CGAGCGTCTT TCAATTTCAT CACCGATGAG AGAGGAAGAC CGCCCCCCTC 
1951 AAAAACAAAG ATGGTGGAAG ATATTCTGAT CCTGGGCTTC AGGAGCCTTG 
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2001 CCTTTAAAAC CTGAATCAGC ATTCTAGCGA TGCTGATAAG AAGTAAATAT 

2051 AGCCACAATA GAGCGGCCAT TTTCCATTCA CATACAGCTC ATCATGTGAT 

2101 CAATATCAAG TATTGATATT CATCAATGGA GAAGAATTTA CATGTATCAC 

2151 AGGATCATCA CAGCATTTGT TTTTGTATTT CTAAGTGCTA ACATAACTAT 

2201 CGCTGGCCCT AAAGAAGATT GTACTATTGC AGTATCTCAC CTTGGGTTTC 

2251 AGACCGATAA TTACAGCTTT GTCGAAGCCG GTTTTTTTGC CAGAGAGAGA 

23 01 CACGTTTTTG ATGGTGTAAT AAACTGCTAC GTATCTCATG ATGGTAACAT 

2351 ACACAGCATC ATCCGGGGCA ACACACCTCT TATGGAAGAT GGATATTATG 

2401 GCCCAGAAGT ACTGGCGGAA AAACGCGATA TTGAGGCACA GGCCCGCACT 

2451 TTAGAGGCGG AAGCC TATAA CGAGTACCAA AACACTAGAA GCCAGATTGA 

2501 GGAAAATAGG GAACGTG 
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